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DURING THE SCHOOL YEAR 1965-66, GROUPS OF FIRST-GRADE 
PUPILS IN THE LIVONIA SCHOOL SYSTEM, MICHIGAN, WERE PROVIDED 
WITH READING INSTRUCTION MATERIALS FROM EIGHT PUBLISHERS 
REPRESENTING EIGHT APPROACHES AS PART ONE OF A 2-YEAR STUDY. 
THE MATERIALS WERE THE LIFPINCOTT BASIC READING SERIES, THE 
SCIENCE RESEARCH ASSOCIATES BASIC READING SERIES, THE SCOTT 
FORESMAN BASAL READING SIXTIES SERIES, THE GINN BASIC 
READERS, THE MICHIGAN SUCCESSIVE DISCRIMINATION LANGUAGE 
READING PROGRAM, THE STERN STRUCTURAL READING SERIES, THE 
MCGRAW-HILL. (SULLIVAN) PROGRAMED READING MATERIALS, AND THE 
INITIAL TEACHING ALPHABET PROGRAM. PRELIMINARY MEASURES WERE 
COLLECTED ON MENTAL ABILITY, READING READINESS, SOCIOECONOMIC 
STATUS, teacher ESTIMATE OF READING MOTIVATION, INTEREST IN 
READING, DOMINANCE, AND PRESENCE OR ABSENCE OF SPEECH 
PROBLEMS. THE FOLLOWING QUESTIONS WERE CC^SIDERED IN THE 
COMFARISC*N— WHICH METHOD (1) PRODUCES THE HIGHEST LEVEL OF 
ACHIEVEMENT, (2) FROD>JCES HIGHEST LEVEL OF WRITING ABILITY, 

(3) SEEMS BEST SUITED FOR CHILDREN OF HIGH OR LOW PREVIOUS 
INTEREST IN READING, (4) PRODUCES A SPECIAL TYPE OF READING 
ACHIEVEMENT, (5) IS BEST SUITED FOR HIGH OR LOW READINESS 
LEVELS, (6) IS BEST FOR GIRLS OR BOYS, (7) IS BEST FOR 
CHILDREN WITH SPEECH PROBLEMS. A DETAILED DISCUSSION OF THE 
PROCEDURES AND FINDINGS IS GIVEN. THIS PAPER WAS PRESENTED AT 
THE INTERNATIONAL READING ASSOCIATION CONFERENCE (SEATTLE, 

MAY 4-6, 1967). (RH) 
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AH AHALTSIS OF LlGffS DIFPEREHT BEADING INSTRUCTIONAL METHODS USED WITH 

kBST GRADE STUDENTS 

Session XI * Research In the Pedagogy of Reading 

School districts, in ipproachlng the problem of selecting mater- 
ials on Instructional methods for the teaching of reading, are 
frequently bewildered by the varying claims of publishers and authors 
of such programs* It Is nearly always possible to find In the 
literature evidence which supports the success of eadi program with 
some grotq> of children somewhere* The question for the school dlst* 
trlct then becomes one of evaluating these claims and reported 
successes in terms of the particular set of conditions which operate 
on the local scene* 

Systematic study of the output of the various programs, given 
the local conditions. Is a task which Is too often neglected •• Such a 
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atu^ la not basic research in the traditional sense, althou^ the 
replication of many local efforts across the country mtsf veil provide 
basic knovledge about the effectiveness of various kinds of prograstf* 

It is, nevertheless, a very in^ortant and even critical activity in 
which local school districts should engage. 

During the 1965-66 school year., grot^s of first grade children in 
the Livonia sdiool system were provided with reading instruction nater- 
ial from eight different publishers, representing eight different 
approaches or materials, as the first part of a two-year study* The 
materials were: Lipplncott Basic Beading Series (J^; Science Research 

Associates Basic Beading Series {2) ; Scott Foresman Basal Reading 
Sixties Series <3); Ginn Basic Readers (^; Michigan Successive Dis- 
crimination Language Reading Program (6) ; Stem Structural Reading 
Series (7>; McGraw-Hill (Sullivan) Programed Reading Materials (8); 
and the Initial Teaching Alphabet <l/t/a) (9) program. Preliminary 
measures were collected on mental ability, reading readiness, socio— 
eccmomic status, teacher estimate of reading motivation, interest in 
reading, dominance, and presence or absence of speech problems* 

Materials were assigned by an essentially random procedure among 
those teachers who were willing to volunteer for new reading materials* 
The Ginn Basic Readers material served as a control, with an a<lded 
advantage in that it, too, was a new adoption for the 1965-66 school 
year. Thus, all of the matttrlals were essentially new to the people 
who were using then in the classroom, although, in each case,, some 
previous familiarization had taken place before actual teaching began. 

The assu^>tlon of equivalence among groups was tested by means of 
an analysis of variance of the pre-measures on mental ability and 
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reading readineaa. No aisnif leant differencea among group# were found 
on mental ability. A difference appeared, however, on reading readi- 
neea. The entire atu^ grotg> was, therefore, divided into aubaeta for 
the purpoae of analyaia. 

The Lippincott and McGraw-Uill materials formed a aid>set with the 
Ginn, while the remaining materlala formed another atibaet overlapping 
Ginn, but not including Lippincott and McGraw-Hill. In other words, 
there were no significant differences between the Ginn control groiq[> 
and the Lippincott and McGraw-Hill groups, nor between the Ginn con- 
trol group and the remaining groups in the stu^. There were, however, 
significant differences between Lippincott, lIcGraw-Hlll and the groups 
using materials such as the Michigan Successive Discrimination Language 
Program. 

After a year's instruction, during which all teachers were pro- 
vided with appropriate consultative service and demonstration help 
^dien needed, a preliminary analysis of reading achieveomnt was made. 

The children were tested using the Stanford Achievement Test, Primary 
Battery I, 1S64 edition. Prior to the analysis of the data, the dif- 
ferences in group sire were moderated somewhat by random sampling from 
the population of children who were working with any given material. 
Test scores were tabulated and the Hartley tests were used to 
assess the validity of the assumption of homogeneity of variance. 

This assumption was sig>ported for the smaller esq;>extmental subset and 
for the larger experimental subset with one exception; a rather large 
divergence in the VJord Study Skills section of the Stanford Achievement 
Test. The data were, however, studied by means of the analysis of 
variance model with the divergence accounted for, where possible, by 
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alternatlv* analyses, further random samples, categorized on the 
basis of variables of interest (i.e., male and female, high and low 
readiness, etc.) were also drawn, were subjected to the sane tesU, 
and were analysed by appropriate statistical methods. Including dis- 
tribution free statistical tests. 

Tentative Conclusions ; 

The coaytarlson of the eight coomercial materials and reading 
instruction methods was undertaken to provide a basiu for answering 
the following questions: 

1. VTbich among the eight methods and materials seem to produce 
the hipest level of reading achievement for children in 
general? 

2. Which among the eight methods and materials seem to produce 
the hipest level of writing ability for children in general? 

3. Are there reading materials and cethods whidi seem to be 
uniquely suited for use in instructing children of high or 
low previous interest in reading? 

Corollary to this question, is it possible to collect accurate 
information about levels of reading Interest? 

4. Are there differences among the materials and methods in 
producing t^es of reading achievitment? 

5. Do groups of children categorized by high and low levels of 
(•adiness perform better with some materials than with others? 

6. Do girls perform better than boys with certain types of 
materials and methods? 

7. Are some materials uniquely suited for teaching children with 
speed! problems? 
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£, Are some i&aterlala and methods better suited for teaching 
children frith nixed dominance patterns? 

The findings support the following conclusions! 

I With respect to question one, no significant F* ratios were found. 
The data, therefore, give us reason to believe that Livonia children, 
in general, have no advantage or disadvantage when they are taught by 
Lippincott linguistic or McGraw-Hill programed reading as opposed to 
Ginn reading haterial. 

In the larger esqperimental subset, however, the groups ranked 

themselves in the following order: 

Xhe 1/t/a grotq> equals or significantly exceeds the Ginn groiq> in 

all five subtests. 

Scott Foresnan Sixties groi^ does likewise, except that the ScOtt 
Foresnan group exceeds the Ginn group In only one subtest. Vocabulary, 
(t • 2.59 df 6,120)* while the i/t/a group ^ceeds Ginn in three: 
Spelling (t - 3.53 d£ 6,120) , Word Study Skills (t • 3.86 df 6,120) ^ 

and Word Reading (t, • 5.61 ^6, 120). 

The group using the Stern material does not differ from the Ginn 
group in Vocabulary, Spelling, or Word Study Skills. The obtained 
means were lower than Ginn in Spelling and V7ord Study Skills, but not 
significantly so. The Stem material is significantly poorer on Par- 
agraph Meaning (t • 2.67 df 6,120) and Word Reading (t • 2.81 df 6,120) 
than the Ginn group. 

The Michigan Successive Discrimination program equals the Ginn 
material on Vocabulary and Word Study Skills. This group docs less 



^Reported jt values are conq^uted using Dunnett*s method for com- 
paring multiple means with a control (10). 
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wall than the Ginn group on Paragraph Meaning (jt * 2*67 ^ 6,120) » 
Speiilng (t • 3.53 df 6,120), and Word Reading (jt - 2,81 df 6,120). 

The SRA group equals the Ginn group in Vocabulary; it does not do 
as veil as the Ginn grotq» in Paragraph Meaning * 2.67 df 6,120), 
Spelling (t • 2.64 ^ 6,120); and Word Reading (jt * 2.81 ^ 6,120). 

The Word Stu^ Skills mean Is lower than the Ginn mean, but the diver"* 
gent nature of the variances makes this result uncertain. There is, 
however, no reason to believe that the SRA. material was as successful 
or more successful than the Ginn material in Word Stu^ Skills. 

It would appear from these data that the i/t/a material is prom* 
enough to be given a further tryout, and that the Ginn and Scott 
Foresman Sixties* materials succeed quite well in Uvonia classrooms . 
The same statement may be made for the Lippincott and McGraw-Hill 
materials, although the latter was judged by the staff as quite dif- 
ficult to aBq>l6y. The other materials do not offer much general 
promise in the Livonia school setting. Some cautions are, however, in 
order. First, previous experience with Stem material seems to 
indicate that the problem in Paragraph Meaning tends to disappear by 
the end of the second grade. Second, the 1/t/a material nee^ to be 
j^^w^y^T^ted et the end of the second grade likewise, since it is 
possible that the nature of the test, printed in 1/t/a rather than 
traditional orthography for the i/t/e group, favored children who had 
been trained by the i/t)^a method. There is little doubts however, 
that a substantial plurality had a very successful first grade exper- 
ience with 1/t/a. Finally, further testing is essential because these 
results nay be generalized only to the conditions operating this year*. 
A follow-up of these children, and a replication of the study with the 
more promising materials is underway. 
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Tha second question concerns quality of written expression. On 
the basis of a significant difference obtained by a non-paraaetric 
analysis of variance of judgments of writing samples, it appears that 
writing ability is best achieved in groups using SRA, i/t/a, Lippin* 
cott, and llc6raw>Hill materials, in that order. Again, followup is 
required to better assess this finding. 

Uith respect to question three, no method of instruction seems to 
be uniquely iq>propriate for high or low interest levels. The data 
collected do, however, siqtport a conclusion that a measure of interest 
in reading can significantly predict a subsequent level of achievement 
after a year's instruction (F • 18.87 1,123). While this finding 

answers a question posed as a corollary in this study rather than a 
direct concern, it does open the possibility for further exploration 
of methods for improving reading instruction. 

Concerning question four, the data stq>port the conclusion that 
the eight materials of the study do vary considerably in the kind of 
reading achievement which they most effectively foster (F • 32.39 
df 4,490). Word Study Skills emerge most consistently strong, and 
Paragraph Meaning is the most consistently weak aspect of reading 
achievement. The specific relation between materials and type of 
achievement is discussed under question one. 

The data support conclusions (S’* 254.4) concerning question five 
that for Paragraph lleanlng, l/t/a, Ginn> and Scott Foresman materials, 
in that order, produce better achievement in the low readiness groups, 
while no significant differences occur in the high readiness groups. 

The Spelling subtest seems to indicate that in the low group, the l/t/a 
material * 3.90 df 7,62) is associated with the highest success, 
while the Stem material is associated with the lowest success 



i 



Page 8 

Paul B. Caapbell^ et al 



* 5.46 7,62>. Again» no differences appear among the high 

groups* The previously mentioned caution about the i/t/a version of 
the Stanford Achievement Tests applies here* The significant H sta- 
tistics (263 and 251) for the Word Beading atuilysis confirm again the 
differences among materials for all children in the study* No precise 
probability statement about interaction between levels of readiness 
and materials is possible here, but the data seem to indicate that 
Ginn, i/t/a, and Scott Foresman materials are likely to be more produc 
tlve in the low groups, and i/t/a, Scott Foresman, and McGraw-Hill 
materials seem to fill this specification in the high groups* 

Concerning question six, no materials seem to be uniquely suited 
for boys or girls* In addition, an unpredicted finding appears from 
these data; it is that there appears to be no difference between boys 
and girls in reading achievement in the first grade* This is contrary 
to common assumptions* Furthermore, this finding was duplicated for a 
differmt sample of Livonia first graders by a previous study (5)* It 
appears possible that teachers in Livonia in the first grade are at 

least into account some of the.differences between boys and 

girls which effect reading instruction, or else the differences do not 
exist when Instruction begins* 

No material appeaxsd to be uniquely better than the others for 

children when there were speech problems, as question seven asks, nor 
was any difference in general reading performance found between the 
spee^ problem group and the no speech problem group* A slight trend 
in the direction of lower achievement associated with speech problems 
is evident, but it cannot be ruled out as a chance occurrence* 

Concerning dominance patterns, as posed by question eight, no 
material emerges as being uniquely best for children with mixed 
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donlnimce patterns, or conversely, for children with established dom- 
inance patterns.* Some dif*€erences appear among •the means between the 
groiq>s« For exa^[»le, in Paragraph Heaning, Scott Foresman, Stem and 
i/t/a materials seem to he most effective for the mixed dominance 
group, ^diilc Ginn, Scott Foresman, HcGraw-Uill, and i/t/s seen to be 
best for the established dominance groups* Likewise, in the Word Study 
Skills, i/t/a, Scott Foresman, Stem and Lippincott materials ^pear 
to produce the best achievement for the mixed grotq>, idiile for the 

V 

Mtablished group, Scott Foresman, 1/t/a, McGraw-Hill and Stem mater- 
ials seem to do the best job* These occurrences, of course, may be 
ffh anc* orderings, and no firm conclusion should be drawn from them* 
Again, the lack of differences between the overall achievement of the 
mixed and established dominance groups, where it cOuld be tested, 
seems to indicate that these factors are being taken into account 
fairly well by teaching methods in mse in Llvmia schools at the 
present time. 

In considering the total pattern which emerges from this study, 
it appears that the most significant result is the existence of highly 
significant differences among the materials and methods for all 
children, as posed by the first question* 

It should be further recognized that all such findings as these 
are probabilistic; in other words, the results may apply to most 
children, but there will always be exceptions who may profit by an 
approach which does not show as promising general results as the 
others which are included in this study* Teachers should continue to 
try differing approaches when children do not seem to fit with the 
pattern which works best for the total group •• 

o 
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AnalvftiR of Heaiiliig' A^jAi^iynt Anontt Different Katerlali and Instructiotul Approach^ 

The null hypothesis for this part of the study is that there are no significant 
differences anong the ae a n s In reading achievesent for children who have been taught 
using the eight different materials. 

The results for the Ginn> Llppincott and McGraw-Hill subset are shown below 
In Table I. 



TABLE I 

Means and F Ratios for Subset A 





Ginn 


Llnolncott 


McGraw-Hill 


F 


Paragraph Meaning 


20 


19 


19 


.43 


Vocabulary 


22 


22 


26 


1.75 


Spelling 


21 


21 


18 


1.60 


Word Study Skills 


25 


26 


25 


.12 


Word Reading 


20 


20 


20 


.U 


Metro. Reading Readiness 


67 


70 

^5,95 “ 


70 


.99 



Became no significant differences were founds no further analysis of this 
subset atteagited. 

The 61nn» Michigan Successive Discrimination, SEA, Scott Foresman $0s, and 

Stem materials, however, present a different picture. Table II shows the results 

of this analysis. Following the establishment of a significant F ratio for each 

of the five subtests on the Stanford Achievement Test» a cooq>arlson was run between 

the Ginn control group and each of the other material groups. The method used was 

that of a specialized t test using the t distribution published by Dunnett (Winer, 

1962, p 89). Because the sizes of the groups differed, a harmonic mean of ^e 

groups was used with the Mean Square^^ ^ ±a the formula for the t ratio. 

error 

On the Paragraph Meaning subtest, the 1/t/a group and the Scott Foresman 60s 
do not differ significantly from the Ginn control group. The i/t/a mean la one 
point higher and the Scott Forosman 60s mean Is one point lower. The Michigan 
Successive Discrimination group, the SRA Linguistic group, and the Stem group are, 
however, significantly different from the Ginn group, and In a negative direction. 

On the Vocabulary sub test, the Scott Foresman 60s group differs from the Ginn 
group significantly, and in a positive direction, 1/t/a, Michigan Successive Dis- 
crimination, and Stem all equal the Ginn group, while the SRA group shows a non- 
significant positive difference of one point. 

On t^e Spelling subtest, the 1/t/a group differs significantly from the Ginn 
group in a positive direction. The Scott Foresman 60s group also differs In a 
positive direction, but not significantly. The Michigan Successive Discrimination 
group, the SRA group and the Stem group differ In a negative direction, with 
the first two being significant differences. 

The Word Study Skills subtest of the Stanford Achievement Battery shows these 
results: The 1/t/a group differs significantly in a positive direction from the 

Ginn group. The Scott Foresman 60s group differs In a positive direction, but not 



TABLE II 

Cooparison of Six Instructional Approaches and Hatsrials in First Grade Reading Study 
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significantly. The Midiigan Successive Discriuinetion group, the Linquistic 
grotq», end the Stem Structural group all differ in e negative direction. The 
SRA grotq> shows a significant difference. This sub test, however, is the one 
showing a great divergence in variance, and the SRA grotq» has the nost divergent 
variance of any of the groups in the study. To test the effects of this diver 
gence, the SRA groiq> was renoved and the analysis was rerun. The i/t/a group 
renains significantly different in a positive direction, but the significance of 
the difference of the SRA groiq> is unresolved. 



The Word std>test shows the following results; The i/t/a group diffem 

significantly from the Oinn group in a positive direction ^ The Scott Foresnan 60s 
group is equal to the Ginn group. The Michigan Successive Discrimination group, 
the SRA, and the Stem all differ significantly in a negative direction. 

TABLE III 
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Stem 94 
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Analyis of Writing Quality Among Flr»t Grade Reading Groups 

A further anelyeis of the differences in writing quality snong first grade 
reading groiq>s vas made. Results are shown in T^le IV. For this purpose, a 
randoB saaple vas drawn froa a standardised writing assignaent by each aeaber of 
the ei^t different groups. The writing saoples were rated by two judges as 
to quality, and rank ordered from one to 125. A noa*paraaetric one-w^ analysis 
of variance was applied to these data. The results show a significant difference 
among the groups on the quality of composition. Ixispection of the average rank 
orders shows the i/t/a, Lippincott, and McGraw-Hill groups forming one cluster; 
the SRA material stands alone as the single best scorer; and theralhining materials 
cluster at the opposite end of the range of means* It is probable that the SRA 
material contributes most to the variance reported in the significant F ratio. 

The performance of the Lippincott and McGraw'^ill groups may be accounted for, 
in part, because these groups were those highest in readiness at the beginning. 

The i/t/a material is surprising, because this group was one of the two lowest 
in readiness at the beginning of the study. A follow-up at the end of the second 
grade should further clarify the status of writing ability among the several 
reading instruction groups. 



TABLE IV 



Analysis of Difference in Writing Quality 
Among First Grade Reading Study Materials 
(Lowest mean represents highest quality) 



Ginn i/t/a 

N - 14 17 

X « 67.8 51.2 



ULppin- McGraw< 
cott Hill 



12 



8 



44.4 



MSDLP 



8 

93.9 



SRA 


Scott 

Foresman 


Stem 


10 


15 


33 


37.9 


62.5 


76.7 



H 



(1,99) 



21.12 

20.09 



46.6 



leading Anal vlt of Hrgt Crade Sy tcwidc todlug StMdy 

The effect# of Interaction between differing levels of reading interest 
and the several aethods of reading instruction were studied b> aeans of an in- 
terview schedule wfaidi each teadier coopleted individually with ea^\ student. 

The reading interest Interview schedule contained a series of questions and 
probes designed to elicit fro* the child his perception of the level of reading 
interest in his earlier experience and his own reactions to opportunities to 
enjoy books. On the basis of this interview, tiie students were divided into hl^ 
interest and low Interest groups. These two groups were c<»posed of randoa 
sai|>les of ten students from each method at each level. The results of the 
criterion test, the Stanford Achievement Test, for these groups were analysed 
in a factorial design having two levels of interest, seven levels of method, 
and five levels of adiieveaent type. The MSDLP method was not included in 
this analysis because a sufficient sample was not ava il a b le from this grotqt. 

The results of the analysis are shown In Table V (a) and (b). 

The critical values for the F ratios shown in Table V (a) , column 6 ure 
not those which would normally be associated with a design of this ^ 

rsMon £or this Is thstp upon oxswlnstlon of the dstSp the essunptloiis of 
homogeneity of variance and covariance appeared to be doubtful. Therefore, 
the conservative critical values of the F ratio suggested by Box, and by 
Greenhouse and Geisser, (Winer, pages 96 and 305) , were used in evaluating the 
hypotheses of no differences among the grotq>s* 

As indicated on the table, two significant main effects and one Inter- 
action emerged. They are factors B and C and interaction AB. Interpretation 
of these findings can best be considered in the light of the purposes for idiich 
t b* was originally set up. The purposes are reflected in the form of 

the following questions: 

1. Are there differences In achievement among the several methods 
of Instruction? 

The data from this particular sample would answer this question 
in the negative. It should be noted, however, that for the larger 
sample previously analysed in this study, significant differences 
among Mterials were found. 

2. Is there a difference in achievement between high and low Interest 
levels? 

This question is answered affirmatively by the data. The factors 
measured by the reading Interest schedule apparently make a dif- 
ference in the outcome of reading achievement. 

3. Is there an interaction between methods of instruction and interest 
levels? 

For this sample, the answer is again negative. 

4. Is there a difference in types of achievement among the students? 

Or, put another way, are students better in some typos of achieve- 
ment than in others, as measured by the Stanford Achievement Test? 

The answer to this question is affirmative. There is a hi^ly 
significant difference (see Table V (a)). 
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' 5. Is there Interaction between the aetbode anii types of adileveaent? 

Again, this question is answered positively for the saaple under 
study. In other words, this neans that sons types of achieveaent 
are aore likely to occur with some aethods than others. 

6. Is there any interaction between interest level and type of 
adiieveaent? 

Ihe data do not show such an interaction. 

7. And, finally, there is no apparent multiple interaction between 
interest levels, aethods, and types of achievement for the saiQ>le 
in question. 

An exanination of the aeans reported in Table V <b) will show the probable 
location of soae of the significant relationships uncovered by this analysis. 

With respect to the significant difference for levels of Interest, it matf 
be noted from Table V (b), that of the 35 p^iirs of aeans considered across the 
high Interest and low Interest groups, only six show differences in favor of 
the low interest group. Of the six, four ate found in the 1/t/a group, and two 
in the Scott Foresman group. This suggests an interaction between aethod md 
level, but due to the problems with this sample, requiring very conservative F 
tests, a chance occurrence cannot be ruled out. 

With respect to the significant F ratio relating to type of achievement, it 
fluy be noted froa Table V (b) , that the aeans range in size in such a Banner that 
the differences within groups may be as little as four points, and as much as 14 
points in the high interest category, and from a difference of three points to a 
difference of 14 points in the low Interest category. In other words, this means 
that one should expect a considerable diversity in types of achievement regardless 
of interest level. Significant method by type interaction indicates that spas 
methods produce a greater dlversl^ in types of achievement ^an others. The means 
of Table V (b) indicate that 1/t/a might be one of these with a range of 14 for the 
high group, and 12 for the low group; that the Stem aethod might be another in 
the low group with a range of 14; and that Ginn, T.ipplncott, and McGraw-Hill 
aaterials in the high group, likewise, might be expected to produce large differ- 
ences in types of achievement. 

Several interesting possibilities emerge from this analysis. First, it 
appears likely that the Interview schedule used to determine reading interest 
does in fact measure a factor which can significantly predict level of achieve- 
ment for most methods of instruction utilized in this study. Second, the 
several aiaterials or methods seemed to differentiate significantly among the 
types of reading adtlevement, with Word Study Skills emerging as the most 
consistently strong factor, and Paragraph Meaning- as the most consistently 
weak aspect of reading achievement. Finally, the problems encountered in 
analysing this sample, having to do primarily with lack of homogeneity of 
variance, are such that further experimentation is not only indicated for the 
purpose of checking these results, but might bold promise for other significant 
findings which are not supported by these data. 






TABLE V 

Analysis of Variances 



(a) Achievement by Interest^ Hethod, and Type 



Source of Variance 


ss 1 


df 


j IIS 


F 


^5- 95 


Between subjects 


29,965.24 


136 


221.14 






A 


2,999.78 


6 


499.96 


2.93 


5.12 


C 


3,218.08 


1 


3,218.08 


18.87* 


5.12 


AC 


2,865.29 


6 


477.54 


2.80 


5.12 


Subjects vithih groups 
[error (between) ] 


20,882.09 


123 


170.46 






Within subjects 


20,035.80 


546 


36.69 






B 


3,398.38 


4 


849.59 


32.39* 


3.92 


AB 


2,862.36 


24 


119.26 


4.54* 




BC 


157.44 


4 


39.36 


1.50 


3.92 


ABC 


764.63 


24 


31.77 


1.21 


2.17 


B X subjects within 












groups 

[error (between) ] 


12,852.99 


490 


26.23 







A * Hethods *Signl£icant at or above the 5% level* 

B « Achievement types 
C « Interest level 



(b) Mean Adsievement by Interest, Method, and Tjrpe 
(Grade Equivalent Scores) 








Analy^^ff i>**i<-t**^ Arfilevaent Categorized by Rg«dlnc«» Lgvel* and by Haf ri^ 

J y 

Tha coMparisoii MBODg groiq;>s vfal^ Is of IntSYMt in this part of ths sttsfy 
may bast be expressed by the following question: Do groups of children categor- 
ized by levels of readiness perfom better with sone naterials than with others? 

In order to answer this question, the data should be analyzed means of a three- 
factor analysis of variance. Unfortunately, this statistical tool requires cer- 
tain assumptions about the data which could be only partly met by the scores whidi 
were obtained in this stu^. therefore, several less precise alternative methods 
had to be utilized. Specifically, the assumption of homogeneity of covariance was 
avoided by using a two-factor analysis of variance and treating each sub test of 
the Stanford seties separately. The assumption of homogeneity of variance was 
accounted for by two alternatives. One, if only one or two groiq>s were signifi- 
cantly deviant from the remainder of the groups, in terms of this assumption, 
these were dropped from the analysis. Two, if a more general problem seemed 
present, a non-parametric analysis, not requiring homogeneity of variance, was 
applied. The disadvantage of these procedures is that it is not always possible 
to directly answer the question of comparison whidi is posed. Given the conditions 
of each set of test scores, however, the most precise method which could be 
justified was used. The resulting inferences reflect these conditions. 

The results of the analysis of reading achievement by readiness levels and 
by achievement are shown in Table VI (a) . It appears from these data that children 
classified as low In readiness will perform better in Paragraph Heaning with i/t/a, 
Ginn, and Scott Foresman materials, in that order. The other materials tested all 
produced lower average achievement on this sthtest. This conclusion is si^ported 
because no significant differences were found among the means of the high group, 
while highly significant differences appaiur among the means of the low grot^. Ds- 
cause separate analyses of the groups were required, a precise probabillQr statement 
of the interaction between materials and level is not possible with this sub test. 



Concerning the Vocabulary sub test, no differences appear in answer to the 
major question. The two significant results shown merely confirm the results 
of the preliminary analysis on differences among materials for all diildren, and 
the expected confirmation of the ability of readiness tests to predict how well 
children will adilave after a year’s instruction. These two conclusions are 
supported by the sigitf leant F ratios for materials and categories respectively. 
Significant differences were found in the lew group between the Ginn material and 
Uppincott material, and between the Ginn material and Scott Foresman material. 

Ginn was considered as a control, and the direction of the differences was posi*!* 
tlve for Scott Foresman and negative for Uppincott. In other words, the Uppincott 
material did less well than the Ginn, whereas the Scott Foresman grottp did 
better (see Table VI (b)). 



All three F ratios for the SpelUng subtest are significant. These findings 
may be Interpreted as follows: With respect to the slgnlf leant category F ratio, 
the data again confirm the ability of the readiness tests to predict achievement. 
The significant materials F ratio, however, must be interpreted in ll^t of the 
significant interaction F ratio. In other words, there are certain groins at 
certain levels which perform better with certain of the materials. Which material 
fit with whldi groups is shown by Table VI (b). It may be observed from thU table 
that the significant differences all occur in the low group, and that the 1/t/a 
material is associated with the highest success, while the Stem material is 
elated with the lowest success. No significant differences appear among the high 
grotq»s. In reading this table, note that all comparisons are u^e with the Ginn 
Basic Beading Frogram, and also that the previously mentioned cautions about the 
i,/t/a version of the Stanford Achievement Tests apply here. 



jThe results of the analysis of the Word Study Skills subtest shov. no evidence 
of Interaction between eaterlals and levels. The significant results for both the 
high and low groups esrely confirm again the existence of significant differences 
for all children among the eight materials studied. 

The significant H's for the Word Beading analysis confirm again the differences 
among materials on this subtest for all children In the study. No precise probabll* 
Ity statement about Interaction Is possible here because the non-paramstric analysis 
of variance does not provide this Information. Inspection of the average rank orders 
however. Indicates that idille Ginn, 1/t/a, and Scott Foresman materials are likely 
to be most productive In the low group, 1/t/a, Scott Foresman, ai^ HcGraw-Hlll 
materials seem to fill this specification ih the high group. 

TABLE VI 



(a) Beading Achievement Categorised by High and Low Beading Beadiness Groups and Materi 





Paragraph 

Meaning 


Vocabulary 


Spelling 


Word 

Studr SldJLla 


Word 

»^**^^«* 






Bisk 






■ liflir.,- Hith 


t«» Blfh 


W 


JULsk. 


inn (bj) 


52.75 


22 


20 


29 


19 


24 


SO.O 30 


51.8 


36.65 


't/a (bj) 


63.7 


28 


16 


25 


24 


29 


60.4 44 


67.85 


65.40 


-ipp. (b.) 


31.75 


21 


15 


24 


17 


25 


32.0 31 


35.25 


40.40 


(b.) 


35.2 


21 


ee 


** 


** 


** 


33.2 30 


28.7 


44.2 


iVit (bj) 


30.6 


21 


17 


28 


14 


20 


30.55 2S 


25.66 


34p4 


lA (bj) 


18.0 


17 


20 


24 


15 


19 


20.8 18 


17.4 


16b35 


-F (b.) 


44.4 


25 


26 


30 


19 


27 


54.65 43 


49.6 


56.56 


':em (bg) 


29.65 


24 


17 


24 


12 


25 


24.0 31 


38.16 


36.55 



;ategorles 42.44* 

iterlals H*254.40* F"2.13 3.63* 

ateractlon * 77 

F Batlo 



81.76* 

11.42* 

2 . 22 * 
F Batlo 



H-255.13* P-4.90* H-263.40* 250. 

“.001 ■ **-32 



*Slgnif leant at or above the 5Z level. **Kot Included In this analysis. 
Mean grade equivalent score is recorded when F ratio Is given, and average 
rank order Is recorded when H Is given.. 




(b) Individual Comparison Among Materials by Beadiness Levels 

Vocabulary Spelling 

^Dunnett ^Dunnett 



Study 


Low 




b **b<» 
1 2 


1.97 


1.05 


“l ■’*3 


2.46* 


1.32 


bi-b5 


1.47 


.26 




0 


1.32 


“l "“7 


-2.95* 


- .26 


b, -b. 


1.47 


1.32 



Low High 



-3.90* 


—2.22 






1.56 


- .44 






3.90* 


1.77 


d.f. 


62 


3.12* 


2.22 


^.05 


- 2.35 


0 


- 1.33 






5.46* 


- ,44 
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An*lyl>_of Beading Adilevtamt C>tegorl«td bv S€x and bv Miterlal 



A fut^ar mutlyiu of the Stanford Achiavaaent acoraa for students participating 
in the first grade reading study was iiada by classifying scores according to tiia sax 
of the studant, and according to the natariaX irlth ufaldi they iiere taught* Again, 
the coaparison of interest was one of interaction betveen aateriaXs and the other 
classifying Tariable, sex* In other words, do boys perform better with certain 
materials than girls* In three of the subtests, at least one but not aore than two 
of the aaterial groiqpings had to be dropped because the data froa thea did not fit 
tile requireaents of this type of analysis* The results of the raaaining groups are 
shown in Table VII* Tte data in this tible show no significant interactions* It 
docs not appear to s^ce toy difference idiether the sex of the student is male or 
feaale in deteraining tiie probable level of success with these reading aaterlals* 

The significant F ratios for materials merely confirm again the fact tiiat for all 
students in the study, there are differences aaong the materials in rea^ng achieve- 
ment* The nonsignificant categories F ratios are especially interestisg because 
they are contrary to the usually eipected findings for boys and girls* It is cooi» 
monly assumed that girls learn to read aore easily in the first grade than boys do* 
Such a finding does hot emerge from tiie data of this study* It is also interesting 
to note that a previous stu^y in Livt.tiia| conducted with a different sample of 
first graders, reports the saae finding*^ Iftiile this situation mqr be accounted 
for in several ways, it appears possible that teachers in Livonia in the first 
grade are at least taking into account some of the differences between boys and girls 
Which effect reading instruction, and somehow counteracting them* An alternative 
possibility is supported by the fact that a preliminary analysis of the readiness 
scores, subdivided on the basis of achieven^t and material, shows no signf leant 
differences* In other words, it seems that tiie boys and girls in this study got 
about the same start* 

TABLE VII 

Beading Adilevement Categorised by Sex and by Haterlal 



Paragraph 



Vocabulary 



^peUlBA 



Word 

Study Skills 



Word 



Ginn (bp. 
i/t/a (bp 

Llpp* (bj) 

MeSHl (bp 
MBOLP (b^) 
SBA (bp 
3-F (b^) 
Stem (bg) 
Categories 
Materials 
Interaction 



Bovs Girls 


Boys 


Girls 


_Boas_. 


Girls 


Boys 


Girls 


Boys 


Girls 


20 


21 


20 


24 


21 


22 


26 


27 


20 


20 


18 


22 


21 


19 


24 


25 


36 


38 


26 


28 


** 


** 


19 


22 


19 


25 


19 


33 


18 


23 


21 


18 


27 


24 


22 


17 


30 


23 


23 


19 


16 


18 


21 


21 


14 


18 


20 . 

o ^ 


24 


16 


17 


16 


19 


25 


21 


17 


20 


ee 


** 


15 


18 


19 


20 


25 


22 


25 


24 


32 


29 


21 


20 


** 


*♦ 


24 


20 


17 


21 


19 


25 


e* 


** 


1.78 




*917 




2*61 




1*31 




*75 




1*73 




1*129 




4.41* 




3*38* 




7*26* 




1*06 




1*498 




1*48 




1*38 




1*24 




F Ratio 




F Ratio 




F Ratio 




F Ratio 




F Ratio 





♦S ignificant at or above the 5% level* **Wct included in the analysis* 

^Ipbodian, June J* “An Analysis of Certain Dimensions of Teacher Behavior 
Du^g Reading Instruction in the First Grade." Doctoral dissertation presented 
' at Kent State University Graduate School, March, 1966* 



10 - 






Analysig of bir 

j ^ 



Speech Pre i>lf«f hy Material 



Thie enelyele vae undertaken to detenine If any of the saterlala tried out 
in the first grade reading study seem to produce better achievement for children 
vith speedi problens. If soae one or nore of the materials was uniquely suited 
to such children, then interaction should appear between the categories of speech 
problem and no speech problem, and among the materials* Again, in this a 

preliminary analysis of levels, of readiness shows no differences among the grot^s 
before instruction began* 



Table VIII shows the result of this analysis* There are no significant inter- 
action F ratios* Therefore, no material appeared to be uniquely better than the 
others for children with or without speech problems* The significant F ratios for 
materials reflect, of course, the expected difference among all children for eight 
different approaches to reading instruction* These differences are expected as a 
result of the preliminary analysis of the total groups where such a finding was 
supported. The lack of significant results between categories of speedi problems 
and no speech problems is Interesting, because one would suspect that the existence 
of sp^di .problras mi^t be considered evidence of a relatively low level of develc^ 
ment of reding readiness skills, mad would, therefore, ei^ct lower scores for 
presenting speech probleme* There is a slight trend in the direction 
of lower scores in the presence of speech problems,, but it is neither consistent 
for all subtests and materials, nor great enough to be attributed to nca-chance 
factors* Since precision of measurement is alwiqrs a problem in studies of this 
kind, it would not be wise to conclude that speech difficulties create no problens 
in reading adiievenent* Further, stu^ of this phenomenon should be undertaken* 



TABUS VIII 



Beading Achievement Categorised by Speech and Uaterlal 



inn (bp 
-./t/a <b 2 ) 

^ipp. (bj) 

TcG-H (bp 
(bp 
. (b^) 

•?— F (b^)' 
./^ern (bp 

.ategorles 
..laterials 
ater action 



Paragraph 
Meaning. ■ 



Word 

VnriiKt.iM>y SnelUng Study Skills 



Vord 

Beading 



No spi 


Speech 


No sp* 


Speech 


No sp* 


Speech 


No sp* 


Speech. 


No sp* 


Spent 


Problem Problm Problem 


froi>l«n Probtow 


Probl^ ProhlfflL 


Problem Problem. Pjrobic 


20 


19 


24 


17 


23 


20 


28 


26 


21 


19 


22 


18 


22 


17 


25 


25 


31 


30 


26 


26 


29 


17 


23 


21 


24 


22 


27 


28 


22 


20 


20 


18 


26 


25 


19 


18 


27. 


23 


21 


19 


ee 


** 


22 


23 


17 


16 


25 


19 


18 


15 




e* 


22 


21 


** 


ee 


18 


18 


15 


16 


18 


18 


28 


26 


23 


22 


26 


29 


20 


18 


** 


e* 


20 


21 


16 


20 


19 


25 


15 


18 


3.13 




1*93 




.25 




.03* 




*95 




.39 




1.63 




4.23* 




2.31* 




7.00* 




*39 




,46 




.54 




.51 




.64 





^Significant at or abevs the 5% level* 
AANot included in the analysis*. 



An^iyuiiK of Readir r fiM^ ^gorlzed bv Doaiaan ce Pattern «nd by Hittrl<l- 

Ihe esti^lltbMent of patterns of dosinance In children Is both a function 
of developaent and of relatively penaanent individual differences. That char- 
acteristics of doBlnance can influence ability to learn to read has been estab- 
lished by previous* research. The purpose of- this analysis was, again, to 
discover whether sone one- or oore of the awterials was uniquely better suited to ; 
a pattern of mined dominance or a pattern of established dominance. The existence 
of a significant interaction should indicate the effect of sudi a material. Mixed 
dominance was considered to exist when a child was inconsistent in his use of 
his hands, eyes, legs, or other parts of the body in performing physical ta^. 

For example, a ^Id mi:^t write with his right hand, but use his left eye to 
look through a telescope. Because equality of development was ^parent: among 
the groups, as evidenced by no significant differences in reading readiness 
scores, the diildren with mixed dominahce were more likely those idio had' a 

Xess permanent sdLxed dominance pattern. These were rather few in number.- 
It was, therefore, not possible to find substantial samples of children with 
mixed dominance in all of the material groupings. This condition required the 
use, in some cases, of a non-parametric analysis whidi does not produce a precise 
estimate of interaction. Table IX shows the res: Its of this analysis. 



As e^q>ected, the differences among materials for all diildren were, in 
general, supported in each subtest except the Vocabulary section. Sample sire 
may easily account for the lack of significance here. The Vocabulary, Spelling, 
and Word Reading subtests could be precisely analyzed for interaction, and no 
significant results emerged. The existence of interaction between materials and 
type of dominance mi^t be inferred from an examination of the rank orders of 
f he averages for each material. For exaoqile, in the Paragraph Meaning s 
Scott Foresman, Stem, and i/t/a materials seem to be most effective for the 
mixed dominance groiqi, while Ginn^ Scott Foresman, McGraw-Hill, and 
most effective for the established dominance groups. The Word Study Sktl^ sub- 
test. like the Paragraph Meaning sub test, had to be treated by npn-parametrlc 
methods within the categories. Again, the examination of the rank orders ^ 
material averages shbws a different ordering of these averages. For the mixed 
group, i/t/a. Scott Forea»an, Stem, md Uppincott .wtetlaU appe«* 
the best achievement. For the established group, Scott Foresman, i/t/a, McGraw 
Hill, and Stem materials seem to do likewise. 

This is a very risky kind of Interpretation, however, and should be checked 
by further study. 



TABLE XX 



AchievMient Categorized by OoDiiience and by Haterial 

Paragri^b Word Word 

Vocabuliry Spelling Study Skilly . , , - 





Mixed 


Eatab- 

llahed 


Mixed 


Eatab- 

Uahed 


Mixed 


Eatab- 

Uabed 


Mixed 


Eatab- 

liahed 


Eatab- 
Mixed liahed 


Ginn 


36.4 


47.4 


23 


22 


20 


21 


20 


39.95 


19 20 


1/t/a (bj) 


40.15 


45.95. 


20 


20 


25 


24 


34 


51.6 


26 25 


Idpp. (b^) 


37.5 


38.4 


16 


25 


23 


22 


30 


37.25 


** ** 


HcG-H (b^) 


28.75 


46.2 


24 


26 


15 


20 


20 


44.05 


18 22 


HSDLP (b^) 


25.2 


34.0 


25 


22 


16 


18 


24 


30.4 


17 17 


SBA (b^) 


24.9 


32.35 


21 . 


24 


17 


20 


19 


25.9 


17 18 


SrP (by) 


44.8 


46.45 


28 


-j26 


25 


25 


34 


51.8 


21 20 


scam (bg) 


42.15 


33.25 


21 

\ 

.79 


24 


22 


20 


31 


43.05 


20 17 


Categorlea 








.62 










MaterlalaH*219.99* 248.06* 1.08 




3.80* 




F-12.66* 


H-253.33* F*4.96* 


Interaction 






.76 

F Ratio 




.56 

F Ratio 








.74 

F Ratio 



*.001 ■ 

*Slgnlflcant at or abdve the 3Z level. 

**Not Included In the oialyala. 

Hean grade equivalent acute ia recoded when t ratio ia given, and average 
rank order la recorded when H la given. 



Off/laaearch 

PBCsns 

11/66 




-.13- 
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